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Abstract 

We prove ergodicity of a point process earthquake model combining the 
classical stress release model for primary shocks with the Hawkes model for 
aftershocks. 

Earthquakes, secondary earthquakes, point process, stochastic intensity, er- 
godicity, Harris chains, Foster criterion 



1 Introduction 



The times of occurence of earthquakes in a given area of seismic activity form a 
simple point process N on the real line, where N((a, b]) is the number of shocks in 
the time interval (a, b}. In the present model, the dynamics governing the process 
will be expressed by the stochastic intensity \(t). In intuitive terms (to be precised 
in the next subsection) 

\(t) = lim-P(N{{t,t + h\) = l\F t ) 
hio n 

where Tt is the sigma-field summarizing the available information at time t (increas- 
ing with t). In the stress release model, for t > 0, 



X(t) 
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where c > and {Z n } n >i is an i.i.d. sequence of non-negative random variables 
with finite expectation, whereas Xq is some real random variable. The process 

N((0,t]) 

X(t)=X + ct- Z " 

n=l 

is ergodic, and the reader is refered to [9] for a proof and the relevant results con- 
cerning a generalization of this particular model. 

Another model of interest in seismology is the Hawkes branching process, where the 
stochastic intensity is 

A(t) = u(t) + / h(t-s)N(ds), 

J{0,t] 

where h is a non-negative function, called the fertility rate and v is a non-negative 
integrable function. Such point process appears in the specialized literature under 
the name ETAS (Epidemic Type After-Shock; [12]) and is used to model the after- 
shocks (see [7], p. 203). It is well known that the corresponding process "dies out" 
in finite time under the condition J °° h(t) dt < 1 (|6J). 

A model mixing stress release and Hawkes aftershocks is Ql3|) 

A(t) = e Xo+ct -^=°' t]) Zn + Y e~ at + k I e - a{t - s) N{ds), 

J(o,t] 

where a > 0. The positive constant c is the rate at which the strain builds up. If 
there is a shock at time t, then the strain is relieved by the quantity Z^uy Each 
shock (primary or secondary) at time t generates aftershocks according to a Poisson 
process of intensity a(s) = ke~ a ^~ s \ In this article, we give necessary and sufficient 
conditions of ergodicity for this model. We shall start with a precise mathematical 
description of it. 

2 Description of the model 

Let (p : ( — oo, oo) — > [0, oo) be a non-decreasing function such that lim^^-oo (p(x) = 
and lim-c^oo <p(x) = oo. We operate under either one of the following assumptions: 
(a): the function (p may be strictly positive everywhere or (b): it is equal to zero 
for all x below some level and otherwise strictly increasing. 

We are given 

(1) a Poisson field II of intensity 1 in the positive quadrant and 

(2) an i.i.d. family of positive random variables {Z n } n >i with a finite mean, 
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and it is assumed that 

(3) the Poisson field and the i.i.d. family are independent. 

The above Poisson field and i.i.d. family constitute the probabilistic basis of our 
model. 

We consider a simple point process N with the following stochastic intensity: 



X(t) = up 



[Xo + ct-TzJ + Y e~ at + k I e- a(t - s ^ dN{s), (1) 

\ n=i J J m 

where N{t) := N((0,t]), and where X , Y , c, k, and a are as in the introduction. 
This means that the point process is constructed recursively as 

N(t) = [ I I{z< X{t-))U{dz x dz). 

J(0,t] Jr 

Defining 

T t := cr{X ; Y ; N(s), Z N ^, s < t}, 
the process {X(t)} t >o is then the ^-stochastic intensity of N in the sense of jl] (see 

also [u, [ng). 

In the seismological interpretation, 

N(t) 

Ai(t) = cp{X + ct- J2 z n) (2) 

n=l 

is the stochastic intensity of the primary shocks, whereas 

A 2 (t) = Y e- at + k [ e' a{t - s) dN{s) (3) 
J(o,t] 

is the stochastic intensity of the aftershocks. 

The problem is to find a necessary condition for the existence and uniqueness of the 
corresponding stationary process and, for any initial distribution of X and Y , of 
the convergence to that distribution, and to prove formally that it is also sufficient 
(under a further smoothnes condition on the distribution of Z{). 

3 On the ergodicity condition 

The existence of ergodicity will be proven in the case 

k , . 
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This section shows that this is indeed a natural (intuitive) condition and moreover 
that it is necessary if we seek only those solutions for which the steady-state average 
intensity A := i?[A(i)] satisfies < A < oo. 

We therefore henceforth assume ergodicity. From now on we use the notation 

N(t) 

X(t)=X + Ct-J2 Z n 
n=l 

and 

Y(t) = Y e~ at + k I e- a(t ~ s) dN(s). 
J(o,t] 

The process (X(t),Y(t)),t > is a time- homogeneous Markov process with initial 
value (X , y ), and 

\(t)=<p(X(t))+Y(t). 

Further, ergodicity means, in particular, that there exists a stationary version of 
the process (X(t),Y(t)). For such a stationary version, let Ai = E[ip(X(t))] and 
A 2 = EY(t). Then A = Ai + A 2 , so the finiteness of A implies that of Ai and of A 2 . 
Observe that 

E[Y(t)\ = E[Y(0)]e~ at + kE[ [ e~ a(t - s) N(ds)] 

J (PA 

= E[Y(0)]e- at + k [ e- a(t - s) \ds] 
Jo 

= E[Y(0)]e- at + A-(l - e~ at ) 
a 

where we used Campbell's formula. Therefore, from the stationarity, i?[Y(0)] = 
Aj = E[Y{t)}. Then 

° k 

A = Ai + A- = Ai + A 2 . 

a 

The supercritical case. Suppose, in view of contradiction, that ^ > 1. The last 
equality then implies that A = oo, which we excluded, or that A = 0, and then 
Ai = E[ip(X(t))} = 0. Since (p(X(t)) > 0, this implies P(ip(X(t)) = 0) = 1, that is 
P(X(t) = -oo) = I. Similarly P(Y(t) = 0) = 1. 

The critical case. Suppose now, again in view of contradiction, that ^ = 1. The 
last displayed equality implies then that A = oo (excluded) or Ai = and therefore 
P(<p(X(t)) = 0) = 1. Then 

\(t) = Y(0)e- at + k f e- a{t - s) N(ds). 
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We show that any point process N with this stochastic intensity and with finite 
average intensity is necessarily null (with intensity equal to 0). Suppose that A > 0. 
Clearly, 

P(N(R + ) = 0) = E[P(N(R + ) = 0\Y(0))] 

= E[e-^ Y(0)e ~ atdt ] 

> e -E\Y{0)]h =e -Ai >Q 

and therefore, since we assumed A < oo, we have that P(N(R + ) = 0) > 0. Now, 
{N(R + ) = 0} C 9 t {N(R+) = 0} = {JV([f,oo)) = 0}. That is, {N(R + ) = 0} is 
expanded by the (ergodic) shift, and therefore it has probability or 1. By the 
above, this probability must be 1. We conclude that A = 0, a contradiction. 
Therefore in the critical case there is no solution except the trivial one (no earth- 
quakes) . 



4 Explicit expressions for the average rates 

In this section, we exhibit an interesting feature of the model. We assume here again 
ergodicity and the condition < A < oo. We continue to consider the model in the 
stationary regime. Writing 

<PW)) = V (x(0) +t^c- E Z n 

= <p(X(0)+t(c-\E[Z 1 ]+e(t))), 

where lim^oo e(t) = a.s. Let A := c — XE[Zi\. Let r be the (a.s. finite) random 
time such that t > r implies \e(t)\ < ||A|. 
Suppose that c — \E[Z\] > 0. We have 

E[<p(X(t))} > E[<p(X(t))l {t > T} ] > E&(X(0)+t±A)l {t > T} ] 

But tp(X(0) + t|A)l( t > r } | oo as t — > oo and therefore 

Ax = E[p{X{t))\ ^ oo 
implying that A = oo which is excluded. 

Suppose that c—XE[Zi] < 0. We show that this is imposible. Here lim^oo (p(X(t)) = 
by a similar argument. We prove that limtfoo E[<p(X(t))} = 0, Ai = and therefore 
A = which is impossible. 

For the proof that lim^oo E[(p(X (t))} = we can make use of the following lemma 
(in fact taking care of both situations when c — \E[Zi] ^ 0). 
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Lemma 1. If the stationary stochastic process {Z(t)} t >o is such that it tends almost 
surely to a deterministic constant c as t j oo ; then it is almost surely equal to this 
constant. 

Proof. Fix e > 0., and consider the set 

C = {uo- Z(t, u) e [c - e, c + e] for all t > 0}. 

Then for all a > 0, 

9 a C = {uo; Z(t, uo) E [c — 5, c + e] for all t > a}. 

But 6 a C t fi, and therefore P(C) = P(6 a C) | 1. So that P(C) = 1. Since this is 
true for all e > 0, 

P{Z(t) = c} = 1, for all t > 0. 

Therefore, necessarily 

A _ c 
E\Z X \ 

Therefore, in this model, the rate of occurences of earthquakes is given by the physics 
of stress build up (the constant c) and stress release (E[Zi\), whereas the global rate 
is shared among primary and secondary earthquakes according to the physics of the 
aftershocks (a and k). 

5 Two embeddings 

We now turn to the technical core of the paper, namely the proof of existence of 
a unique ergodic solution of the model, under the condition - < 1 and a further 
condition on the distribution of Zi (see Condition (CZ) in Section 7). The technique 
used is that of Harris chains, and we start as usual by studying a natural embedded 
process. More precisely, let {(t n )} n >o, with t = 0, be the sequence of time events 
of N, and let for each n > 0, T n+1 := t n+1 — t n , X n := X(t n ), Y n := Y(t n ). We 
then have the recurrence equations that exactly reflect the dynamics described in 
the previous section: 

X n +i = X n + cT n+ i — Z n+ \ 

and 

Y n+1 = Y n e- aT ^ + k 

where S n+ i is a positive random variable whose hazard rate is, conditionally to 
-^0) • • • , X n , Yq, . . . , Y n , Ti,...,T n and Z\, . . . , Z n 

<p(X n + cs) + Y n e- as . 



6 



It is clear that {(X n , V n )} n > is a homogeneous Markov chain. Its transition mecha- 
nism is fully described by the first transition, which can be implemented as follows: 
Let X = x and Y — y > 0. On the positive half-plane with the time t running on 
the horizontal coordinate axis, draw two curves: 

(a) a curve with graph (t, <p(x + ct)) (that starts from (0, ip(x))); 

(b) a curve with graph (t, —ye~ at ) (that starts from (0, — y)). 

Consider the projection on the time axis of the Poisson field between the above two 
curves and let Ti be the point of this projection with the smallest t-coordinate. It 
has, as the notation anticipated, the required hazard rate ip(x + cs) + ye~ as . In 
particular, 

P(7\ >t\X = x,Y = y) = P x , y (T >t) = e-^- e ~ at ) • e -J>(*+H* 

and 

X\ = x + cTi — Z\, 
Y 1 = ye- aT ' + k. 

Two lemmas concerning this particular realization of the transition kernel will be 
useful. 

Let T XtV be a "generic" random variable with distribution 

P(T XtV e ■)=P(T 1 e ■ | X = x,Y = y). 

Following the comments from above, one can represent T x y as 

T x>y = min (T^ X \T™) (5) 

where T^ 1 '^ and T^ 2,v ^ are independent and 

P(T^' X ^ > t) = e~ti f( x + cv ) dv ^ 

P (T(2.»)> f ) = e -l{^- at ). 

Clearly, 

p( T (2,i/)) = = e -i//a > 0) 

for any y > 0. 

Lemma 2. (1) For any < yi < y-i, 

T (2,yi) > gt T (2,2/ 2 )_ (g) 
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(2) For any x\ < x 2 , 

J>(Mi) >^ J>(l>a:2) ^ 

Xl + cT ( 1 ^) <^ X2 + cT (M2). ( 8 ) 
t4/so, /or any x ; P(T( 1,:E ) < oo) = 1 and, moreover, for any a > 0, 

P ( T (M) > t yat _^ 0? as / ^ 

Proof. Inequality (jfJJ) is straightforward. Inequality (|7|) follows from the mono- 
tonicity of <p while inequality (|8]) follows from the following coupling construction: 
Let to > be such that x\ + cto = ^2- If there is a point of the Poisson field in 
{(t, u) : < t < t , < u < ip{xx + ct)}, then T^' 1 ) < t and x a + cT^' x ^ < x 2 . If 
however there is no such a point, then 

x l + cTt 1 '^ =x 2 + cT {1 ' X2) ■ 9 l1 

where {#*}t>o is a family of measure-preserving shift transformations. So, with 
probability 1 

xi + cT (1,a:i ) < x 2 + cT (1 ' a ' 2) • 0* = st x 2 + cT (1,:r2) . 

The remaining results follow from inequality ([8]) and from the fact that f(x) — > oo 
as x — > oo. 



Corollary 1. For any x , 



sup ET I|9 — ► as y — > oo. (9) 

x>a:o 



sup ET Z ^ — > as x — > oo. (10) 

?/>o 

Proof. By Lemma 2, < T XjV < st T XOjV for any x > x . Clearly, T (2,2/) — > 
in probability as y — > oo. Since ET^ 1 ' 1 ' ^ is finite, the family of random variables 
{T XOt y} y >o is uniformly integrable, and therefore 

sup ET x>y = ET XOiV ^ as y -> oo. 

x>xo 

Further, from inequality (jSJ), < st T^ 1 ' 3 ^ where — > in probability. By 
(JTj) and since ET^ 1 ' ) is finite, the family {T^ 1,:r ^} x >o is uniformly integrable, and 
therefore ET^ -> as x — > oo, and then (1101) follows. 



Lemma 3. ^4s y — > oo 

?/E 



yE (V^'-l) - -n. 
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Proof. Indeed, 



- aTi2 ' v) <v\dv 



yE (l - e^™) = yj\(e 

= y [ P(T^ > lnv/(-a))dv 
Jo 

= y e~^ l ~ v " > dv (change of variables: 
Jo 

= y J e~^du ^change of variables: 



u — 1 — v) 



a 



i-y/a 

a I e~ r dr — > a, 



o 

as y — > oo. 

Remark. As follows from (jSJ) and Lemma 2, if x > 0, then 



and, therefore 



sup ET T „ < ET (1 ' 0) < oo. fli: 



x>0,y>0 

One may also deduce from Lemma 2 that if x < 0, then 

cT XiS/ < si |z| + cT (1 ' 0) + Z 

where the random variables in the right-hand side are integrable. So one can find a 
universal constant C > such that 

ET XiS/ < C{\x\ + 1), for all x < and y > 0. (12) 

Then it follows from ffTTl) and (fT2l that, for any negative Xi, 

sup Er X)!/ < oo. (13) 

x>xi,y>0 

However the supremum in (I13p becomes infinite if one replaces x% by — oo. 

To keep the supremum finite, we consider a slightly different embedding. Again 
we describe the first transition only. We fix a sufficiently large positive Vq and a 
sufficiently large negative X\ (to be chosen in the next section) and define the new 
embedding {T XtV } as follows: 

(a) if x < xi, then T Xjy = mm(T x>y , t> ) while 

(b) if x > x u then T XtV = T x>y . 
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Then clearly 

sup ET X:V < oo. (14) 

a;G(— 00,00), y>0 

Denote by (X n , Y n ) a new time-homogeneous Markov chain obtained by the new 
embedding. It satisfies the relations: given X — x, Yq — y, if x > x±, then 

X\ = s t x + cT x y — Z 

where T xy and Z are mutually independent, and 

Yx =st ye~ aT ^ + k, 

and if x < xi, then 

Xi = st x + cT Xj y - ZI(T XyV < v ) 
where T xy and Z are mutually independent, and 

?i =st ye- af *>y + kl(T x>y <v ). 
6 Positive recurrence of the embedded process 

In this section, we show positive recurrence of the Markov chain obtained by the 
second embedding (X n , Y n ) - see the end of the previous section. We recall some 
known facts. 

Definition. Consider a discrete-time and time-homogeneous Markov chain W n , n > 
on a measurable state space (W, £>w). A measurable set V C Z is positive recurrent 
if the following two conditions hold: 

(a) A random variable 

r w (V) := min{n > 1 : W n E V \ W = w} 

is a.s. finite, for all w G W; 

(b) Moreover, 

sup Et w (V) < 00. 

For a Markov chain W n , the following result is known as Foster's criterion: 

Proposition 1. Let L : W — > [0, 00) be a measurable function, and let L be a 
non-negative number. The set V = {w G W : -^(w) < L} is positive recurrent if 

(i) sup wgy E(L(VFi) |W = is finite; 

(ii) there exists e > such that 

£(L(Wi) I Wb =w)- L(w) < -e, 

for any w G W \ V . 
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Theorem 1. Let k < a. For suitably chosen vq and x x , there exists a compact set 
V in (—00, 00) x [0, 00) which is positive recurrent for the Markov chain (X n , Y n ). 

Remark. A sequence (X n , Y n ) is a subsequence of (X n , Y n ). With arguments similar 
to those of Theorem [H one can also prove that the same set V is positive recurrent 
for the Markov chain (X n , Y n ). 

Proof of Theorem [Q We use Foster's criterion, with the following choice of the 
test function: 



L(x,y) = Lx{x) + L 2 (y) 



rix + r 2 y, if x > 
r 3 |x| + r 2 y, if x < 0, 



where r 1 ,r 2 ,r 3 are strictly positive (to be chosen later). 
First of all, for any Ci,C 2 > 0, 

sup sup E(L(Xi, Yi) I X = x, Y = y) < 00. (15) 

\x\<Ci y<C 2 

Indeed, let r = max(r 1; r 2 , r 3 ). Then, for any (x,y) from the set above, 

T < + T^'~ Cl ^ — T 

- 1 x,y —st - 1 ± 1 

and T has a finite mean. Therefore, for all (x, y) from this set, 

E(L(Xx, Y x ) I X = x, % = y) < r{C x + C 2 + ET + EZ + k) < 00, 
and (TT5"j) follows. 

Now we impose several constraints on the coefficients r\,r 2 ,rs. First, we assume 
that 

r 3 < n (16) 

and 

r{EZ > r 2 k. (17) 
Let a — k = 2A > 0. We also assume that 

r 2 A > r 3 EZ. (18) 

In the proof, we use only conditions (fl6|) - (fl8|) which are, in particular, satisfied if 

r l » r 2 » r 3 > 0. 

Now we proceed to show that all the differences 

E{L{X X ,Y X ) I (X ,Y ) = {x,y))-L(x,y) 
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are bounded above by some negative constant if (x, y) is outside the set [ X\ , ^o] ^ 
[0, yo] where x\, xq and yo will be chosen in the proof. For this, consider separately 
two cases: (a) x > and (b) x < 0. 

Case x > 

In this case, the one-step embedding is the natural one, so we may write (X 1 ,Y 1 ) 
instead of (X 1: Y"i). 

Let 

37 = min{r 2 A - r 3 EZ, r ± EZ - r 2 k} > 0. (19) 
Choose x > so big that 

ncET^ < 7, (20) 

and 

(n+r 3 )E(Z-x ) + < 7 . (21) 
By Lemma 3, we may choose yo > so large that 



yE (l - e -" T(2 ' y) ) - k > |A, for all y > y (22) 

and that 

r lC ET 0:V0 < 7. (23) 

Then choose v > so large that 

yE (l - e -«min(„ ,T( 2 .^))^ _ fc > for aU ^ > ^ ^ 

and that the following inequality holds: 

' '' r ' V \-y°l a > 7 + r 3 EZ + r 2 k. (25) 



Write for short 

E, j2/ L(X 1 , = E(L(X, Yi) | X = x, Y = y) = E x , y L 1 (X 1 ) + E^L^Y,). 
If x > 0, then 

E X)W Li(Xi)-Li(a;) = nE ((a; + cT^ - Z)I(x + cT^ - Z > 0)) 

+ r 3 E ((-x - cT x>y + Z)I(x + cT x , y - Z < 0)) - ru 

= cE (r X)W (nl(a; + cT XiJ/ - Z > 0) - r 3 I(x + cT XiJ/ - Z < 0))) 

+ nE(i - Z) + (n + r 3 )E ((-x + Z)I(x + cT^ - Z < 0)) - ru 

< ncET^ - nEZ + (n + r 3 )E(Z - x) + 
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and 

EL 2 (Y1) - L 2 (y) = r 2 (yEe- aT *-y + k) - r 2 y. 
In particular, if x > x , 

E^ y L 1 {X 1 ) - L x {x) < ncET^ - r x EZ + (n + r 3 )E(Z - x ) + 

(where we used representation (jSJ) and Lemma 2) and 

EL 2 (Y"i) — L 2 (y) < r 2 k, 

so in view of ([20]), and 

E Xiy L(X 1 , Fx) - L(x, y) < 7 + 7 - 3 7 = -7. 

Furthermore, ii y >yo and < x < xo, then, by Lemma 2, 

E^Lipd) - Lx{x) < r lC ET 0yyo + r 3 EZ 

and, by inequality (1221) . 

EL 2 (yO - L 2 (y) < -r 2 A, 

so 

E XjJ/ L(X 1 , X 2 ) - L(x, y) < r lC ET 0i , + r 3 EZ - r 2 A < 7 - 3 7 < -7, 
by (fl9l) and (T231 . So if x > and if either x > xo or y > y , then 

S XlH L(Xi,yi)-L(x j3 /) < -7. 

Case £ < 

For the time being, fix any value of x x < 0. First, we observe that if y > yo where y$ 
satisfies inequalities (12"21 and (1231 . then again the increments E Xty L(X 1 , Yi) — L(x, y) 
have a "uniformly" negative drift in all X\ < x < 0. Indeed, if Xq = x G (xi,0], 
then, for any y > 0, X x = x + T X)V — Zi admits the following bounds: 

< 0) < a . s . (|x| + Zi)I(Xx < 0) (26) 

and 

XiI(Xi > 0) = max(0, X x ) = max(0, min(x + cT (M) , x + r (2,2/) )). (27) 
From Lemma 2 and from independence of T( 1,a: ) and T^ 2 ' v \ we obtain 

> 0) < st min(cT (1 ' 0) , cT (2 ^) = cT 0)2/ . (28) 
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Therefore, for any x\ < x < 

E^lL^Xi)! = r 1 E 1 ,j / (X 1 I(X 1 >0))+r 3 E x ,j / (|X 1 |I(X 1 <0)) 

< rxE^dXxl ■ > 0)) + r 3 E x>y {(\x\ + EZi)l(X 1 < 0)) 

< ricET 0i j, + r 3 |x| +r 3 EZ. 

Since y satisfies unequalities (1221) and ff23l) . we have, for all Xi < x < and ?/ > y , 
E x>y L{X x , Y x ) - L(x, y) < r lC ET , yo + r 3 EZ - r 2 A < - 7 . 

We now choose x\ « —1 so large that the increment of L(X,Y) has a uniformly 
negative drift for all x < x\. We start with the assumption that 

xi < ~cv . (29) 

Therefore, if Xq = x < x\ and Yq = y, then 

Lx(Xi) = r 3 (-x - cT X)V + ZI(T X)V < v )) < r 3 (-x - cT X)V + Z) 

and 

L 2 (Y 1 ) = r 2 (ye~ af *-y + kl(T x , y < v )) < r 2 (ye~ af *-y + k) . 

We impose two additional constraints on x\ making it even more negatively large. 
Since <f(x) — > as x — ► — oo, one can choose xi < — ct>o, xi << —1 such that 

e - C <p(x 1+a ,)dv > : y 2 _ ( 30 ) 

Secondly, it follows that T^ 1 ' 3 ^ — > oo in probability asx-> — oo, and therefore, from 
( 124|) . one can choose x\ < — cvo such that 

yE fl - e~ aT *A - k > A, for all y > y and x < x x . (31) 

Assume that x\ satisfies all of the three conditions (T29T) — (T3T~[) . If y > yo then, for 

any x < x\, 

E Xty L{X u Y 1 ) - L(x, y) < r 3 (-cET X;y + EZ) + r 2 (yEe~ af ^ + *) - r 2 y 

< -r 2 A + r 3 EZ < -7, 

by <m>- 

If instead y < yo and x < xi, then 

P(T x , y > v Q ) > P(T XjV0 > vq) > le~ y °/ a , 
14 



since the random variables T x ^ y are stochastically decreasing in y (again by Lemma 
2). Therefore, for T XjV = min(t>o, T XjV ), 

VT XtV > ET XtV > v ■ P(T XiV > vq) > ^-e- y °/ a , (32) 

and 

E x , y L{X X) Y x ) - L{x,y) < r 3 (-cET x , y + EZ)+r 2 (yEe- af *'y + k^)-r 2 y 

< _I^p e -w>A* + r3 EZ + r 2 k < -7, 

due to (125)1 . 

As an outcome, we have that if y satisfies conditions (I2"2"j) - (|2"31 . if v satisfies §H§ - 
(T25|) . and if x% satisfies f l29|) - fl3TT) . then the increments of EL(X\,Y\) have a drift 
bounded above by —7 for all initial values such that either x < x\, or x < and 

y > yo- 

The set 

V = [xi,x ] x [0,yo] 
is therefore positive recurrent for the Markov chain (X n ,Y n ). 

Also, as follows from the classical proof of Foster's criterion, for any initial value 
(X ,Yq) = (x,y), a random variable 

r x>y (V) = mm{n > 1 : (X n ,Y n ) e V \ (X ,Y ) = (x,y)} 

is almost-surely finite and, moreover, there exists an absolute constant C > such 
that 

Er x , y (V)<C(L(x,y) + l), 
for all (x, y) (see, e.g., [11] or [8]). The proof of Theorem [1] is complete. 

7 Harris ergodicity 

We recall the following classical result (see for instance |llj). 

Proposition 2. Assume that a Markov chain W n , n > taking valued in a measur- 
able space (W, £>>v) is aperiodic and that there exists a positive recurrent set V that 
admits a minorant measure, i.e. there exist a positive integer m, a positive p < 1 
and a probability measure \i such that 

P(W m e- \W = w EV)>pfi(-). (33) 



15 



Then the Markov chain is Harris ergodic, which means that there exists a unique 
stationary distribution (say n) and that, for any initial value Wo = w, there is a 
convergence of the distributions of W n to the stationary one in the total variation 
norm, 

sup \P(W n eB)- n(B)\ -»• 0, n -> oo. 

In practice, the most technical part in applying this criterion is to verify the aperi- 
odicity. There are a number of sufficient conditions available for the Markov chain 
to be aperiodic and Harris ergo die. 

We mention two of them. The most common is the following condition. 

Sufficient condition 1 (SCI). A Markov chain W n is Harris ergodic if there exists 
a positive recurrent set V such that condition (133!) holds with m = 1 and with \i 
such that /i(V) > 0. 

However, in our proof, it seems to be easier to verify another — slightly more general 
— sufficient condition. 

Sufficient condition 2 (SC2) A Markov chain W n is Harris ergodic if there exists 
a positive recurrent set V such that condition (1331) holds with a finite number of 
different values of m, say rrii, i — 1, 2, . . . , k which are such that 

g.c.d.{mi, 1 < i < k} — 1. 

We will apply condition (SC2) with k = 2 and with mi = 2 and m 2 = 3. For that, 
we introduce a condition on the distribution of Z which leads to (SC2). 

Condition (CZ). There exist < z\ < Z2 < oo such that, for some h > and for 
any [wi,w 2 ] Q [^1,^2], 

P(Z e [u\, u 2 ]) > h(u 2 - Mi). 

In other words, the distribution of Z has an absolutely continuous (w.r. to Lebesgue 
measure) component whose density function is above level h everywhere in the 
interval [zi, z 2 ]. 

Theorem 2. Assume condition (CZ) to hold. Then the Markov chain (X n ,Y n ) is 
Harris ergodic. 

Proof. We may assume without loss of generality that z 2 — z\ < xq — x\. 
Let Vi = Hi + k and 

Xo = inf{x > Xq : ip(x) > 0} + z 2 + 2, 

and let 

Vi = [xuXq] x [0,2/o], 
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so V C V\. Then, for any (x,y) G V, P Xjy (Y 1 < y ) = 1, so, by Lemmas 2-3, 

Px,»((Xi, Y 1 ) G V ± ) = P Xt y(Xi G [xi, Xq\) 

> P(x + T X)V G [x Q + z 1 -1,x + z ± ], Z x G [z!, z 2 }) 

> P(xi + cT^'^) G [x + *i - 1, x + *i]) 

x P(cT (2 '^ o) > x + ^ - Xi)P(Zi G [^ 2 ])- 

Denote by i? the value of the rightmost side of the above inequality (note that it 
is positive). Then, for any (x,y) G V\, 

P x , y ((X 1 ,Y 1 )eV 1 )>R o >0. 

Take some small positive e < (z 2 — z{)/A (to be specified later). Choose £ 2 > so 
large that x 2 :— X\ + ct 2 > x + z 2 and y e~ a ( X2 ~ x °^ < e. Let b = (p(x 2 — z 2 ) and 
note that b > 0. Then, for any (x,y) G Vi, 

P(x+cT Xyy G [x 2 ,x 2 +e],ye- aT ™ < e) > beP(T^ xl) > (x 2 - Xl ) / c)P(T {2 > y{)) > (x 2 -x 1 )/c). 

Denote by R\ the right-hand side of the inequality above (which is a positive num- 
ber). Then, for any (x, y) G V 1 and for (Xi, Yi) — (x + T X)V - Zi,ye~ aTx ' v + k), 

be 

P Xi y((X 1: Yl) G [x 2 -Z!-e,x 2 - zi] x [k, k + e]) > Ri =: R 2 > 0. 

z 2 — Z\ 

Let 

V — [x 2 — zi — e, x 2 - zi] x [fc, k + e] . 
From the construction above, one may conclude that, for any (x,y) G V, 

inf P x , y ((X 1 ,Y 1 )eV)>R 2 >0 (34) 

(x,y)eV 

(since V C V±) and then that, by the Markov property, 

P x , y ((X 2 ,Y 2 )eV)>Ro- inf P x , y ((X 1 ,Y 1 )V)=R o R 2 >0. (35) 

(x,y)eVi 

Now take e > so small that one can choose positive numbers £ 3 and £ 4 such that 
£4 > £3 > z 2 , that 

k 2 ■ = ke~ aU > (k + e)e~ at4 =: k u 

and that 

5 := e + c(£ 4 - £3) < ^y^ 1 - 
Then, for any y G [k,k + e], we have the inclusion [k±, k 2 ] C [ye""* 4 , ye _a * 3 ]. 
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For any (x, y) G V denote by g x>y (u) a density function of random variable ye al P haT ^>y 
(which clearly has an absolutely continuous distribution). 
Then direct computations show that 



c := inf inf g x , y {u). (36) 

(x,y)eV «G[fci,fc 2 ] 



Indeed, let 



• f m(l + t) , . , . , P(a < T x y < b) 
C\ = mt and c 2 = mi mi - . 

0<t<k 2 /k 1 t (x,y)eV >ln ( fc / fc 2)<a<b<ln((fc+£)/fci) — d 

Then both c\ and c 2 are positive and, for [a, b] C [kx, k 2 ], 

P(ye-^' e [a,*]) = P (*ML < T,„ < ^M) > £l£i (t _ a) , 

\ a a J a 

so (1361) holds with Co = C\c 2 ja. 

Furthermore, let x 3 = x 2 — Z\ — £ + t 3 . Note that if (x, y) G V and T x>y G [£ 3 , t 4 ], then 
x + cT Xjy G [X3, X3 + 8]. Then, by condition (CZ), given x + cT x>y = v G [2:3, £3 + 6], 
the random variable v — Z\ has an absolutely continuous component with a uniform 
distribution on the interval [x 3 — (z\ + 22)/2,x 3 — z%\. 

We may therefore conclude that, for any (x,y) G V, 

P^Xi, Y 1 ) G •) > 2(^2 - Si)" 1 /^ V(0 (37) 

where \i is a two-dimentional uniform distribution on the rectangle V2 := [£3 — (z\ + 
z 2 )/2,x 3 — Zi] x [fci, ^2], coefficient /i is from condition (CZ), and c is from f[3"S"l) . 

It follows from inequalities (pM]) . (1351) . and (|37|) . that condition (SC2) is satisfied 
with k = 2 and with mi = 2 and 7722 = 3, and this completes the proof. 

Corollary 2. Assume again that k < a and that condition (SZ) holds. Consider 
the Markov chain (X n , Y n ) and let < Si < S 2 < ■ ■ ■ S& < . . . be consequtive times 
of the ends of discrete-time "cycles" where random vectors (Xs k ,Yf,) have uniform 
distribution in the rectangle V 2 . Then random variables Ik = Sk — Sfc-i, k > 2 are 
i.i.d. with a finite mean. 

A proof of this result can be found, for instance, in [1] . 



8 Stability in continuous time 

Theorem 3. Under condition (CZ), 

(1) there exists a unique stationary version of the continuous-time Markov process 
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(X(t),Y(t) (which is also ergodic); 

(2) for any initial value X(0) = X = x, Y(0) = Y = y, the process (X(t),Y(t)) 
converges to the stationary one in the total variation norm. 

Proof. Consider again the embedded Markov chain (X n , Y n ) and its cycles of 
length Zfc. Then the corresponding cycles in continuous time are defined as 

L k = ^ Tx^Yi, k = 1, 2, . . . 

i=S fe _i+l 

which are again i.i.d. for k > 2. 

Then a proof of the theorem follows from the two results below, Statement 1 and 
Statement 2 (see, for instance, [I], Proposition 3.8, p. 203 or [2], Section 7 or [3], 
Chapter 3). 

Statement 1. The distribution of random variables Lk, k > 2 has an absolutely 
continuous component with a density function which is separated from on a finite 
time interval of positive length. 

Statement 2. EL 2 is finite. 

Statement 1 may be verified directly using arguments similar to those in the previous 
section. Furthermore, since C := sup^ y ET X)2/ < oo, 

EL 2 < CEl 2 < oo, 

and the result follows. 
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